A STRUCTURAL APPROACH TO SUBSET-SUM PROBLEMS 
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Abstract. We discuss a structural approach to subset-sum problems in ad- 
ditive combinatorics. The core of this approach are Freiman-type structural 
theorems, many of which will be presented through the paper. These results 
have applications in various areas, such as number theory, combinatorics and 
^"^ , mathematical physics. 
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LJ ■ 1- Introduction 

C^ . Let A — {fli, a2, ■ ■ ■} be a subset of an additive group G (all groups discussed in 

this paper will be abelian). Let Sa be the collection of subset sums of A 

Sa:^{^x\BcA,\B\ <oo}. 
xeB 

^^ '_ Two related notions that are frequently considered are 

^' 

o 

oo 

O 



lA := {ai H + a(|aj € A} 



j^ ■ We have the trivial relations 



rA := {ai H h at\a^ e A,i ^ j}. 



I* A c lA and U, I* A = Sa- 
One can have similar definitions for A being a sequence (repetitions allowed). 
Example. 

A ^ {0,1,4}, G^Z,2A = {0,1,2,4,5,8}, 2* A = {1,4,5}, Sa = {0,1,4,5}. 
A = {0, 1, 4}, G = Zs, 2A = G, 2* A = {0, 1, 4} = Sa- 
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Now let A be a sequence: 

A = {1,1,9},G^Z, 3A = {3, 11,19,27},3M = {11}, 5*^ = {1,1,2,9,10,11}. 

Notice that for a large I, lA can be significantly different from Sa and I* A. In 
general, it is easier to handle than the later two. 

Many basic problems in additive combinatorics have the following form: 

If A is sufficiently dense in G, then Sa (or I* A or I A) contains a special element 
(such as or a square), or a large structure (such as a long arithmetic progression 
G itself). 

The main question is to find the threshold for "dense" . As examples, we present 
below a few well-known results/problems in the area. In the whole paper, we are 
going to focus mostly on two special cases: (1) G = Zp, where Zp denotes the cyclic 
group of residues modulo a large prime p; (2) G = Z, the set of integers. 

Following the literature, we say that A is zero-sum-free if ^ Sa- Furthermore, A 
is complete if Sa = G and incomplete otherwise. The asymptotic notation is used 
under the assumption that |^| -^ oo. 

A basic result concerning zcro-sum-free sets is the following theorem of Olson [53] 
and Szemcredi [60] from the late 1960s, addressing a problem of Erdos and Heil- 
bronn [23]. 

Theorem 1.1. (Olson- Szemcredi) Let A he a subset of Zp with cardinality C^fp, 
for a sufficiently large constant C . Then Sa contains zero. 

To see that order ^ is necessary, consider A := {1,2,. ..,n}, where n « ^/2p is 
the largest integer such that 1 + • • • + n < p. 

Concerning completeness, Olson [52], proved the following result 



Theorem 1.2. (Olson) Any subset A of Zp with cardinality at least \/Ap — 3 + 1 
is complete. 

To see that the bound is close to optimal, take A := {— m, . . . , — 1, 0, 1, . . . , to} 
where m is the largest integer such that 1 + • • • + tti < [p/2j . 

Another classical result concerning zero sums is that of Erdos-Ginburg-Ziv [42], 
again from the 1960s. 

Theorem 1.3. (Erdos-Ginburg-Ziv) If A is a sequence of 2p — 1 elements in Zp, 
then p* A contains zero. 

This theorem is sharp by the following example: A = {0^p~^\ i[p-i]} Furthermore, 
instead of and 1, one can use any two different elements of Zp. (Here and later 
x'- ' means x appears with multiplicity k in A.) 
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Now we discuss two problems involving integers. Set [n] :— {1, 2, . . . , n}. An old 
and popular conjecture concerning subset sums of integers is Folkman's conjecture, 
made in 1966 [25]. Folkman's conjecture is a strengthening of a conjecture by Erdos 
[20] about finding a necessary and sufficient condition for a sequence A such that 
Sa contains all but finite exception of the positive integers. 

Conjecture 1.4. (Folkman's conjecture) The following holds for any sufficiently 
large constant C. Let A be an strictly increasing sequence of positive integers with 
(asymptotic) density at least C^pn (namely, \AC\ \n\\ > C^fn for all sufficiently 
large n). Then Sa contains an infinite arithmetic progression. 

Cassels [10] and Erdos [20] showed that density -^/n is indeed needed; thus, Folk- 
man's conjecture is sharp up to the value of C. For more discussion about Folk- 
man's conjecture and its relation with Erdos' conjecture, we refer to [25] and the 
monograph [21] by Erdos and Graham. 

Finally, a problem involving a non-linear relation, posed by Erdos in 1986 [19]. 

Problem 1.5. (Erdos' square-sum-free problem) A set A of integers is square- 
sum-free if Sa does not contain a square. Find the largest size of a square-sum-free 
subset of [n] . 

Erdos observed that one can construct such a square-sum-free subset of [n] with 
at least Q{n^^^) elements. To see this, consider A := {q, 2q, . . . , kq} with q prime, 
{k + l)k < 2q, kq < n. Since the sum of all elements of A is less than g^, Sa does 
not contain a square. Erdos [19] conjectured that the truth is close to this lower 
bound. 

Problems involving subset sums such as the above (and many others) have been 
attacked, with considerable success, using various techniques: combinatorial, har- 
monic analysis, algebraic etc. The reader who is interested in these techniques may 
want to look at [3, 57, 64, 48] and the references therein. 

The goal of this paper is to introduce the so-called "structural approach" , which 
has been developed systematically in recent years. This approach is based on the 
following simple plan 

Step I: Force a structure on A. In this step, one tries to show the following: If A is 
relatively dense (close to the conjectured threshold but not yet there) and Sa does 
not contain the desired object, then A has a very special structure. 

Alternately, one can can try to 

Step /'.■ Find a structure in Sa- HA is relatively dense (again close to the conjec- 
tured threshold but not yet there) then Sa contains a special structure. 

Step II: Completion. Since \A\ is still below the threshold, we can add (usually 
many) new elements to A. Using these elements together with the existing struc- 
ture, one can, in most cases, obtain the desired object in a relatively simple manner. 
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The success of the method depends on the quaUty of the information we can obtain 
on the structure of A (or Sa) in Step I (or V). In several recent studies, it has turned 
out that one can frequently obtain something close to a complete characterization 
of these sets. Thanks to these results, one is able to make considerable progresses 
on many old problems and also reprove and strengthen several existing ones (with a 
better understanding and a complete classification of the extremal constructions). 

The rest of this paper is devoted to the presentation of these structural theorems 
and their representative applications. 



2. Freiman's structural theorem 

A corner stone in additive combinatorics is the structural theorem of Freiman (some- 
time referred to as Freiman's inverse theorem), which writes down the structure of 
sets with small doubling. 

A generalized arithmetic progression (GAP) of rank rf in a group G is a set of the 
form 



{ao + aixi H h adXdlM^ < Xi < TVJ, 

where a^ are elements of G and Mi < Ni are integers. It is intuitive to view a GAP 
Q as the image of the d-dimensional integral box B :~ {{xi, . . . ,Xd)\Mi < Xi < Ni} 
under the linear map 



^{xi, . . .,Xd) = aQ + aixi H V adXd- 



We say that Q is proper if $ is one-to-one. It is easy to see that if Q is a proper 
GAP of rank d and A is a, subset of density 5 of Q, then \2A\ < C{d, 6)\A\. Indeed, 



\2A\ < \2Q\ < \2B\ = 2''\B\ = 2'^|Q| < ^\A\ 



since the volume of a box increases by a factor 2*^ if its sizes are doubled. 

Freiman's theorem shows that this is the only construction of sets with constant 
doubling. 

Theorem 2.1. (Freiman's theorem) [27] For any positive constant C , there are 
positive constants d — d{C) ad 5 — S{C) such that the following holds. Let A be 
a finite subset of a torsion-free group G such that \2A\ < C\A\. Then there is a 
proper GAP Q of dimension d such that Ad Q and \A\ > d\Q\. 
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Frciman theorem has been extended recently to the torsion case by Green and 
Ruzsa [35]. [64, Chapter 5] contains a detailed discussion of both theorems and 
related results. 

One can use Freiman's theorem iteratively to treat the sumset lA , for / > 2. For 
simplicity, assume that Z = 2" is a power of 2. Thus, the set Ag := lA = 2^ A can 
be viewed as 2As^i where Ag-i := 2*~^A. Using a multi-scale analysis combined 
with Fremain's theorem, one can obtain useful structural information about I A or 
A itself. For an example of this technique, we refer to [61] or [64, Chapter 12]. 

The treatment of I* A and Sa is more difficult. However, one can still develop 
structural theorems in these cases. While the content of most theorems in this 
direction are quite different from that of Freiman's, they do bear a similar spirit 
that somehow the most natural construction happens to be (essentially) the only 



3. Structure of zero-sum-free sets 

Let A be a zero-sum- free subset of Zp. We recall the example following Theorem 1.1. 
Let A := {1,2, . . . , n}. If 1 + - • • + n < p, then obviously Sa does not contain 0. This 
shows that a zero-sum- free set can have close to y/2p elements. In [61], Szemeredi 
and Vu showed that having elements with small sum is essentially the only reason 
for a set to be zero-sum-free. More quantitative versions of this statement were 
worked out in [49] and [50]. For example, we have [50, Theorem 2.2] 

Theorem 3.1. After a proper dilation (by some non-zero element), any zero-sum- 
free subset A of Zp has the form 



A = A'UA 

where the elements of A' (viewed as integers between and p -- 1) are small, 
J2xeA'^ <P' and a" is negligible, \A"\ <p6/i3+o(i)^ 

One can perhaps improve the constant 6/13 by tightening the analysis in [50]. It 
is not clear, however, what would be the best constant here. In most applications, 
it suffices to have any constant strictly less than 1/2. 

The dilation is necessary. Notice that if A is zero-sum- free (incomplete) , then the 
set Ax '■— {xa\a G A} is also zero-sum-free (incomplete) for any 7^ x € Zp. 

We can also prove similar results for lA and I* A, and for A being a sequence (see 
[50] for details). In the rest of this section, we present few applications of these 
results. 



3.2. The size of the largest zero-sum- free set in Zp. Let nip denote the size 
of the largest zero-sum- free set in Zp. The problem of determining rup was posed by 
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Erdos and Heilbronn [23] and has a long history. Szemeredi proved that rrip < C^^, 
for some sufficiently large C independent of p [60]. Olson showed that C = 2 suffices 
[53]. Much later, Hamidoune and Zemor [37] showed that nip < y/2p + 5\ogp, 
which is asymptotically sharp. Using an earlier version of Theorem 3.1, Szemeredi, 
Nguyen and Vu [49] recently obtained the exact value of rrip. 

Theorem 3.3. Let Up be the largest integer so that 1 + • • • + (up — 1) < p. 



Ifp ^ ""^"j-^^^ - 1, then mp=np-\. 

zero-sum-free set with Up elements is {—2, 1,3,4,..., Up}. 



Jf p — "p("p — l _ i^ then nip — Up. Furthermore, up to a dilation, the only 



The same result was obtained by Deshouillers and Prakash (personal communica- 
tion by Deshouillers) at about the same time. 

3.4. The structure of relatively large zero-sum-free sets. Let us now con- 
sider the structure of zero-sum- free sets of size close to v^2p. Let ||a;|| denote the 
integer norm of x. In [15], Deshouillers proved 

Theorem 3.5. Let A he a zero-sum-free subset of Zp of size at least ^. Then 
(after a proper dilation) 



J2 \\x/p\\<l + Oip-'/Hogp) 

xeA,x<p/2 

^ \\x/p\\<Oip-'^Hogp). 

xeA,x>p/2 

Deshouillers showed (by a construction) that the error term p^^'"^ cannot be replace 
by o{p~^''^). Using an earlier version of Theorem 3.1, Nguyen, Szemeredi and Vu 
[49] improved Theorem 4.4 to obtain the best possible error term 0(p^^"), under 
a stronger assumption on the size of |^|. 

Theorem 3.6. Let A be a zero-sum-free subset of Zp of size at least .99-\/2p. Then 
(after a proper dilation) 



J2 \\x/p\\<l + 0{p-'/') 

xeA,x<p/2 
xeA,x>p/2 

The constant .99 is, of course, ad-hoc and can be improved by redoing the analysis 
carefully. On the other hand, it is not clear what the best assumption on |^| should 
be. 
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3.7. Erdos-Ginburg-Ziv revisited. Using a version of Theorem 3.1 for sequences, 
Nguyen and Vu [50] obtained the following characterization for a sequence of size 
slightly more than p and does not contain a subsequence of p elements summing 
up to 0. 

Corollary 3.8. [50, Theorem 6.2] Let e be an arbitrary positive constant. Assume 
that A is a p- zero- sum- free sequence and p + p^'^'^^^'^ < \A\ < 2p — 2. Then A 
contains two elements a and b with multiplicities ma, mi, satisfying nia + mi, > 

2(|A|-p-pl2/13+.)^ 

The interesting point here is that the structure kicks in very soon, when A has just 
slightly more than p elements. Few years ago, Gao, Panigrahi, and Thangdurai [43] 
proved a similar statement under the stronger assumption that |^| > 3p/2. 

It is easy to deduce Erdos-Ginburg-Ziv theorem from GoroUary 3.8, together with 
a complete characterization of the extremal sets. The reader may want to consider 
as an exercise or check [501 for details. 



4. Incomplete Sets 

Now we turn our attention to incomplete sets, namely sets A where Sa =/= Zp. The 
situation here is very similar to that with zero-sum-free sets. Szemeredi and Vu 
[61] showed that having elements with small sum is essentially the only reason for 
a set to be incomplete. More quantitative versions of this statement were worked 
out in [49] and [50]. For example, in [50], the following analogue of Theorem 3.1 
was proved 

Theorem 4.1. After a proper dilation (by some non-zero element), any incomplete 
subset A of Zp has the form 



A = A'UA 

where the elements of A' are small (in the integer norm), X]a;eA' ll^/p|l ^ 1 '^"■'^ 
a" is negligible, \A" \ <p6/i3+o(i)^ 

The reader can find similar results for I A and I* A and for A being a sequence in 
[50]. We next discuss some applications of these results. 

4.2. The structure of relatively large incomplete sets. Theorem 4.1 enables 
us to prove results similar to those in the last section for incomplete sets. The 
problem of determining the largest size of an incomplete set in Zp was first consid- 
ered by Erdos and Heilbronn [23] and essentially solved by Olson (Theorem 1.2). 
da Silva and Hamidoune [12] tightened the bound to y/Ap — 7 + 1, which is best 
possible. We are not going to go into these results here, but note that one can 
perhaps obtain a new proof (with a characterization of the extremal sets) using 
Theorem 4.1. 
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Concerning the structure of relatively large incomplete sets, Deshouillers and Freiman 
[17] proved 

Theorem 4.3. Let A he an incomplete subset of Tip of size at least \/2p. Then 
(after a proper dilation) 

^||a;/p||<l + 0(p-V4logp). 

They conjectured that the error term may be replaced by 0{y/p), which would be 
best possible due to a later construction of Deshouillers [16]. 

Using Theorem 4.1, Nguyen and Vu [50] confirmed this conjecture, provided that 
A is sufficiently close to 2^^. 

Theorem 4.4. Let A be an incomplete subset of Zp of size at least 1.99^^- Then 
(after a proper dilation) 



J2\\x/p\\<l + Oi^). 



xeA 

Similar to the constant .99 (in the previous section), the constant 1.99 is ad- hoc 
and can be improved by redoing the analysis carefully. On the other hand, it is not 
clear what the best assumption on \A\ is. 

4.5. The structure of incomplete sequences. Let us now discuss (rather briefly) 
the situation with sequences. The main difference between sets and sequences is 
that a sequence can have elements with high multiplicities. It has turned out that 
when the maximum multiplicity of incomplete sequence A is determined, one can 
obtain strong structural information about A. 

Let 1 < 771 < p be a positive integer and A be an incomplete sequence of Zp with 
maximum multiplicity m. Trying to make A as large as possible, we come up with 
the following example. 



Bm = {-nl'^l, (n - 1)["1, . . . ,-l["l,0["l, l'"', ...An- l)['"l,nW} 
where 1 < k < m and n arc the unique integers satisfying 

2to(1 + 2-\ hn-l) + 2kn <p< 2to(1 + 2 -\ \- n - 1) + 2{k + l)n. 

It is clear that any subsequence of Bm is incomplete and has multiplicity at most 
m. In [50], we proved that any incomplete sequence A with maximum multiplicity 
m and cardinality close to \Bm\ is essentially a subset of Bm. 
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Theorem 4.6. Let 6/13 < a < 1/2 he a constant. Assume that A is an incom- 
plete sequence of Tip with maximum, multiplicity m and cardinality \A\ = \Bm\ — 
0{{pm)°'). Then after a proper dilation, we can have 



A = A'UA 
where A' C S„ and \A" \ = 0((pto)("+i/2)/2). 

4.7. Counting problems. Sometime one would like to count the number of sets 
which forbid certain additive configurations. A well-known example of problems 
of this type is the Cameron-Erdos problem [11], which asked for the number of 
subsets of [n] = {1, 2, . . . , n} which does not contain three different elements x, y, z 
such that X + y — z — 0. Cameron an Erdos noticed that any set of odd numbers 
has this property. Thus, in [n] there are at least il(2"/^) subsets with the required 
property. They conjectured that 2"' ^ is the right order of magnitude. There were 
several partial results [2, 9, 22] before Green settled the conjecture [34]. 

Using structural theorems such as Theorem 3.1, one can obtain results of similar 
spirit for the number of zero-sum-free or incomplete sets and sequences. For exam- 
ple, using an earlier version of Theorem 3.1 and standard facts from the theory of 
partitions [1], Szemercdi and Vu [61] proved 



Corollary 4.8. The number of incomplete subsets of Zp is exp((w ^tt + 0(1))^/^). 

Using Theorem 4.6, one obtains the following generalizations [50]. 

Corollary 4.9. The number of incomplete sequences A with highest multiplicity m 
in Zp is exp((y^(l- ;;^)|7r + o(l))^). 

It is an interesting question to determine the error term o(l). 

5. Incomplete sets in a general abelian group 



Let us now consider the problem of finding the largest size of an incomplete set in 
a general abelian group G, which we denote by In(G') in the rest of this section. 
The situation with a general group G is quite different from that with Zp, due 
to the existence of non-trivial subgroups. It is clear that any such subgroup is 
incomplete. Thus, In(G) > h, where h is the largest non-trivial divisor of [Gl. The 
intuition behind the discussion in this section is that a large incomplete set should 
be essentially contained in a proper subgroup. 

In 1975, Diderrich [13] conjectured that if [Gl — ph, where p > 3 is the smallest 
prime divisor of |G| and h is composite, then c{G) = h + p — 2. (The cases where 
p ~ 2 OT h is a prime is simpler and were treated earlier, some by Diderrich himself 
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[13, 47, 14].) Didderich's conjecture was settled by Gao and Hamidoune in 1999 

[29]. 

The following simple fact explains the appearance of the term p — 2. 

Fact. If Sahh = H for some maximal subgroup H of (prime) index q, then 

\A\<\H\+q-2. 

To verify this fact, notice that A/H is a sequence in the group Zq. It is easy 
to show (exercise) that if B is a sequence of g — 1 non-zero elements in Zg, then 
Sb U{0} = Zq. 

We say that subset A of G is sub-complete if there is a subgroup H of prime index 
such that Sadh = H . 

Once we know that an incomplete set A is sub-complete, we can write down its 
structure completely. There is a subgroup H with prime index q such that |A\_ff | < 
g — 2, and the sequence A/H is incomplete in Zq. (The structure of such a sequence 
was discussed in the previous section.) It is natural to pose the following 

Problem 5.1. Find the threshold for sub- completeness. 

Recently, Gao, Hamidoune, Llado and Serra [30] showed (under some weak assump- 
tion) that any subset of at least -^^h -\- p elements is sub-complete. Furthermore, 
one can choose H to have index p, where p is the smallest prime divisor of \G\. Vu 
[68] showed (again under some weak assumption)that |ft, is sufficient to guarantee 
sub-completeness. It is not clear, however, that what the sharp bound is. 

The situation is much better if we assume that |G| is sufficiently composite. In par- 
ticular, if the product of the two smallest prime divisors of |G| is significantly smaller 
than -^/IGI, then one can determine the sharp threshold for sub-completeness. 

Theorem 5.2. [68] For any positive constant S there is a positive constant D{d) 
such that the following holds. Assume that \G\ ~ pi . . . pt , where t > 3 and pi < 
P2 • • • < Pt are primes such that pip2 < -p7jy\/|G|/ log |G|. Then any incomplete 
subset A of G with cardinality at least {l-\-d)—^^^— is subcomplete. Furthermore, the 
lower bound (1 -I- (5)—^^ cannot be replaced by —^ — \- n^^^^", for any constant a. 

6. Structures in Sa 

As mentioned in the introduction, an alternative way to implement our plan is to 
find a structure in Sa rather than in A (Step I'). A well-known result concerning 
the structure of 5*^ is the following theorem, proved by Freiman [28] and Sarkozy 
[55] independently. 

Theorem 6.1. There are positive constants C and c such that the following holds 
for all sufficiently large n. Let A be a .subset of [n] := {l,...,n} with at least 
C\Jn log n elements. Then Sa contains an arithmetic progression of length c|A|^. 
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It is clear that the bound on the length of the arithmetic progression (AP) is 
optimal, as one can take A to be an interval. The lower bound on \A\, however, 
can be improved to Cy/n, as showed by Szemercdi and Vu [62]. 

Theorem 6.2. There are positive constants C and c such that the following holds 
for all sufficiently large n. Let A he a subset o/ [n] := {1, . . . , n} with at least C^/n 
elements. Then Sa contains an arithmetic progression of length c\A\'^ . 



The assumption |j4| > C^/n is optimal, up to the value of C, as one can construct 
a set A C [n] of e^/n elements, for some small constant £, such that Sa does not 
contain any arithmetic progression of length larger than n"^' "* (see [62] or [63, Section 

3.4]). 

Theorem 6.2 can be extended considerably. Szemercdi and Vu [63] showed that for 
any set A C [n] and any integer I such that I \A\ > n for some constant rf, the 
sumset I* A contains a large proper generalized arithmetic progression (GAP). The 
parameters of this GAP is optimal, up to a constant factor (see [63, Section 3] for 
more details). 

Theorem 6.3. [63, Theorem 7.1] For any fixed positive integer d there are positive 
constants C and c depending on d such that the following holds. For any positive 
integers n and I and any set A C [n] satisfying l'^\A\ > Cn, r*A contains a proper 
GAP of rank d' and volume at least cl'^ \A\, for some integer 1 < d' < d. 



There are variants of Theorem 6.3 for finite fields, and also for sums of different sets 
(see [63, Section 5] and [63, Section 10]). In the following subsections, we discuss 
few applications of Theorems 6.2 and 6.3. 



6.4. Folkman conjectures on infinite arithmetic progressions. Let us recall 
to the conjecture of Folkman, mentioned in the introduction. 

Conjecture 6.5. (Folkman's conjecture) The following holds for any sufficiently 
large constant C. Let A be an strictly increasing sequence of positive integers with 
(asymptotic) density A(n) at least C^pn (namely A(n) := \An [n\\ > Cy^ for all 
sufficiently large n). Then Sa contains an infinite arithmetic progression. 



Folkman [25] showed that the conjecture holds under a stronger assumption that 
A{n) > n^'^"'"'^, where e is an arbitrarily small positive constant. (An earlier result 
of Erdos [20] on a closely related problem can perhaps be adapted to give a weaker 
bound w^^^^'''^.) Hegyvari [45] and Luczak and Schoen [46], independently, re- 
duced the density n^^*^ to C\/nlogn, using Theorem 6.1. 

Using the stronger Theorem 6.2, together with some additional arguments, Sze- 
meredi and Vu [62] proved the full conjecture. 

Theorem 6.6. Conjecture 6.5 holds. 
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In the same paper [25], Folkman also made a related conjecture for increasing, but 
not strictly increasing sequences. Let A{n) now be the number of elements of A 
(counting multiplicities) at most n. 

Conjecture 6.7. (Folkman's second conjecture) The following holds for any suffi- 
ciently large constant C. Let A be an increasing sequence of positive integers with 
such that A{n) > Cn for all sufficiently large n. Then Sa contains an infinite 
arithmetic progression. 

Despite the huge change from ^/n to n in the density bound, this conjecture is 
also sharp [25], and (for some time) appeared more subtle than the first one (see 
a discussion in [21, Chapter 6]). Folkman [25] proved the conjecture under the 
stronger assumption that A(n) > n^'^^. It does not seem that one can obtain the 
analogue of Hegyvari and Luczak-Schoen results due to the lack of a "sequence" 
variant of Theorem 6.1. However, the method in [63] is sufficiently robust to enable 
one to obtain such a variant for the stronger Theorem 6.2. With the help of this 
result, one can settle Conjecture 6.7 

Theorem 6.8. [63, Section 6] Conjecture 6.7 holds. 

The strategy for the proofs of Theorems 6.6 and 6.8 is the following. We first find 
a sufficient condition for a sequence A such that Sa contains an infinite AP. 

We say that an infinite sequence A admits a good partition if it can be partitioned 
into two subsequences A' and A with the following two properties 

• There is a number d such that Sa' contains an arbitrary long arithmetic 
progression with difference d. 

• Let A ^ bi < b2 < b^ < . . . . For any number K , there is an index i{K) 
such that J^j^i bj > bi + K for all i > i{K). 

Lemma 6.9. // a sequence A admits a good partition then Sa contains an infinite 
AP . ' 



The second assumption is easy to satisfied given that A has proper density. Thus, 
the key is the first assumption. The main feature here is that in this assumption, 
we only need to guarantee the existence of long (but finite) APs. So, Theorem 6.2 
and its variants can be used with full power to achieve this goal. 

6.10. Erdos conjecture on square-sum- free sets. In this section, we return 
to Erdos conjecture on square-sum-frce sets, mentioned in the introduction. Let 
SF{n) denote the size of the largest subset A of [n] such that Sa does not contain 
a square (or A is square-sum- free) . Erdos [19] observed that SF{n) = Q{n^/^) and 
conjectured that the truth is close to this lower bound. Since then, there have 
been several attempts on his conjecture. Alon [4] proved that SF{n) — 0{j^^). In 

[40] Lipkin improved the bound to SF{n) — 0{n^^^^^). Later, Alon and Freiman 
[5] obtained another improvement SF{n) ~ 0(n^/'^+^). About fifteen years ago, 
Sarkozy [56] showed SF{n) ~ O(Vnlogn). 
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Let US now address the problem from our structural approach point of view. The- 
orem 6.2 is no longer useful, as we are dealing with sets of size around n^", way 
below the lower bound ^Jn required in this theorem. Fortunately, we have a more 
general result, Theorem 6.3, which enables us to find structures in Sa for any set 
of size n , for any constant 8. In particular, we can deduce from this theorem the 
following corollary. 

Corollary 6.11. There are positive constants C and c such that the following holds 
for all sufficiently large n. Let A be a subset of [n] with cardinality at least Cn^'^ . 
Then Sa contains either an AP of length c\A\'^ or a proper GAP of rank 2 and 
volume c\A\^ . 

Combining this corollary with some number theoretic arguments, Nguyen and the 
author [51] can get close to the conjectured bound. 

Theorem 6.12. There is a constant C such that SF(n) < n^^^log" n. 

We strongly believe that the log term can be removed. Details will appear elsewhere. 

7. Inverse Littlewood-Offord theorems and random matrices 

In this final section, we discuss a problem with a slightly different nature. Let A 
be a sequence of non-zero integers. Now we are going to view Sa as a multi-set of 
2" elements. We denote by Ma be the largest multiplicity in Sa- For example, if 
A = {1, . . . , 1} , then Ma = (l„/2j) = 6(2"/^^). 

The problem of bounding Ma originated from Littlewood and Offord's work on 
random polynomials [41]. In particular, they proved that Ma — 0(2" logn/v^)- 
The log term was removed by Erdos [18], who obtained a sharp bound for Ma- 
Many extensions of this result were obtained by various researcher: Erdos-Moser 
[24], Sarkozy-Szemeredi [58], Katona [38] Kleitman [39], Halasz [44], Griggs et. al. 
[36], Frankl-Fiiredi [26], Stanley [59] etc. Among others, it was showed that the 
bound on Ma keeps improving, if one forbids more and more additive structures 
in A. For example, Erdos and Moser [24] showed that if the elements of A are 
different (i.e., A is a set), then Ma = 0(2" log n/n^^^). In general, the following 
can be deduced from results of [44] (see also [64, Problem 7.2.8]) 

Theorem 7.1. For any fixed integer k there is a constant C such that the following 
holds. Let A ~ {ai, . . . , a„} and R^ be the number of roots of the equation 

^iflii + • • • + £2kai2k — 
with Ei = ±1 and ii,- - - ,i2k G [n]. Then Ma < Cn^'^^^^/'^Rk- 

In [66], Tao and Vu introduced the notion of Inverse Littlewood-Offord theorems. 
The intuition here is that if Ma is large (of order 2"/n'^ for any constant C, say), 
then A should have a very strong structure. 
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The most general example we found with large Ma is the following. Let Q be a 
proper GAP of constant rank d and volume V. If A is a subset of Q, it is easy to 
show that Ma = ^( d/2v ) (in order to see this, view the elements of Sa as random 
sums X]r=i ^i^i where a^ are elements of A and ^^ are iid random variables taking 
values and 1 with probability 1/2). Thus, if the volume of Q is small, then Ma 
is large. 

In [66], Tao and Vu proved the inverse statement, asserting that having A as a 
subset of a small GAP is essentially the only way to guarantee make Ma large. 

Theorem 7.2. [66] For any constant C and e there are constants B and d such 
that the following holds. Let A be a sequence of n elements in a torsion-free group 
G. If Ma > 2"/n'-^ for some constant G, then all hut at most n^^'^ elements of A 
is contained in a proper GAP Q of rank d and cardinality n^ . 



In a more recent paper [67] , the same authors obtained a (near) optimal relationship 
between the parameters C, e, d and B. As a corollary, one can deduce (asymptotic 
versions of) many earlier results, such as Theorem 7.1. (In spirit, this process 
is somewhat similar to the process of using Theorem 3.1 to reprove, say, Erdos- 
Ginburg-Ziv theorem.) 

We would like to conclude this survey with a rather unexpected application. Let us 
leave combinatorial number theory and jump to the (fairy remote) area of mathe- 
matical physics. In the 1950s, Wigner observed and proved his famous semi-circle 
law concerning the limiting distribution of eigenvalues in a symmetric random ma- 
trix [69] . A brother of this law, the so-called circular law for non-. symmetric random 
matrices, has been conjectured, but remains open since that time. 

Conjecture 7.3. (Gircular Law Gonjecture) Let ^ be a random variable with mean 
and variance 1 and Mn be the random matrix whose entries are iid copies of S,. 
Then the limiting distribution of the eigenvalues of -j=Mn converges to the uniform 
distribution on the unit disk. 



Girko [31] and Bai [6] obtained important partial results concerning this conjecture. 
These results and many related results are carefully discussed in the book [7] . There 
has been a series of rapid developments recently by Gotze-Tikhomirov [32, 33], Pan- 
Zhou [54], and Tao-Vu [65]. In particular, Tao and Vu [65] confirmed the conjecture 
under the slightly stronger assumption that the (2 -|- 77)-moment of ^ is bounded, 
for any 77 > 0. 

Theorem 7.4. The Gircular Law holds (with strong convergence) under an addi- 
tional assumption that 

£(1^12+") < (X) 
for some fixed rj > 0. 

The key element of this proof is a variant of Theorem 7.2, which enables us to count 
the number of sequences A with bounded elements such that Ma (more precisely 
a continuous version of it) is large. For details, we refer to [65]. 
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